Retrieval of the most relevant facts from data streams joined with slowly evolving dataset published on the Web of Data
نویسنده
چکیده
Finding the most relevant facts among dynamic and heterogeneous data published on the Web of Data is getting a growing attention in recent years. RDF Stream Processing (RSP) engines offer a baseline solution to integrate and process streaming data with data distributed on the Web. Unfortunately, the time to access and fetch the distributed data can be so high to put the RSP engine at risk of losing reactiveness, especially when the distributed data is slowly evolving. State of the art work addressed this problem by proposing an architectural solution that keeps a local replica of the distributed data and a baseline maintenance policy to refresh it over time. This doctoral thesis is investigating advance policies that let RSP engines continuously answer top-k queries, which require to join data streams with slowly evolving datasets published on the Web of Data, without violating the reactiveness constrains imposed by the users. In particular, it proposes policies that focus on freshing only the data in the replica that contributes to the correctness of the top-k results.
منابع مشابه
Prioritize the ordering of URL queue in Focused crawler
The enormous growth of the World Wide Web in recent years has made it necessary to perform resource discovery efficiently. For a crawler it is not an simple task to download the domain specific web pages. This unfocused approach often shows undesired results. Therefore, several new ideas have been proposed, among them a key technique is focused crawling which is able to crawl particular topical...
متن کاملEffective Learning to Rank Persian Web Content
Persian language is one of the most widely used languages in the Web environment. Hence, the Persian Web includes invaluable information that is required to be retrieved effectively. Similar to other languages, ranking algorithms for the Persian Web content, deal with different challenges, such as applicability issues in real-world situations as well as the lack of user modeling. CF-Rank, as a ...
متن کاملComparison of Information Retrieval Capabilities in Library Software of Payam, Voyager and Aleph
The purpose of this study was comparing Information Retrieval Capabilities in Web-based Library Software of Payam, with Voyager and ALEPH. A checklist designed and included six main trait for evaluation and comparing 73 scales. Data collected by experts' observing of the software's OPAC. Data analyzed by the descriptive statistics methods. Findings shows the preferences in search capabilities i...
متن کاملAn Analysis of Dialogism in Mikhail Bakhtin’s Thought: Convergence of Philosophy and Methodology
Undoubtedly, the twentieth century can be regarded as one of the richest periods of the history of philosophy and thought which globalized this tradition, generally, because of the spread of mass media and even the published books, and joined all vast and narrow streams, here and there, together and at their Juncture a big sea is formed which is the most important gain of the century. One of t...
متن کاملClinical Pharmacology of the Antimalarial Quinine in Children
Quinine is the best studied drug for treating severe malaria in very young children. Quinine may be administered in pregnancy and, at therapeutic doses, malformations have not been reported. Some strains of quinine from Southeast Asia and South America have become resistant. Quinine is the treatment of choice for the drug-resistant severe Plasmodium falciparum. The antimalarial mechanism of qui...
متن کامل